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(57) Abstract 

There is provided an expression control polynucleotide of an invertase gene, which may be operably limited to a heterologous 
polynucleotide. Optionally the expression control polynucleotide and heterologous polynucleotide construct is transfected into host cells or 
organisms. Preferably the construct is used to produce a transgenic plant and the expression control polynucleotide is pollen cell-specific. 
A suitable expression control polynucleotide is as set out in SEQ ID No 1. especially nucleotides 3430-5349 thereof. Desirably the protein 
expressed by the heterologous polynucleotide causes male sterility in the transgenic plants. 
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""Expz-ession Con1:rol Polynucleo'tldes*' 

This invention relates to the fields of plant 
biotechnology and plant genetic engineering. In 
particular it relates to transgenic plant production 
and tissue-specific expression of introduced gene 
sequences in pollen cells. 

A promoter is a non-coding nucleotide sequence which 
controls the transcription of an adjacent nucleotide 
sequence. A number of promoters have been isolated 
from a wide variety of sources, including plants. In 
certain applications it is desirable to genetically 
engineer a construct which comprises a promoter 
operatively linked to a heterologous nucleotide 
sequence such that the promoter controls expression of 
the heterologous sequence in the host cell transformed 
with that construct. Where the promoter is only active 
in particular tissue types expression of the 
heterologous sequence is restricted accordingly and 
this may be especially desirable in some circumstances. 

A number of plant-derived promoters have been isolated 
which activate expression of their companion nucleotide 
sequences only in pollen cells. Use of these pollen 
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1 cell-specific promoters to activate genes encoding 

2 heterologous proteins has also been described [see CA 

3 2021643] and may lead to the production of proteins not 

4 normally present in pollen cells . Such an approach may 

5 allow the expression of heterologous genes which encode 

6 for proteins able to render the plant male-sterile by 

7 ablation of pollen cells (for example if the proteins 

8 are toxic to the pollen cell) or to drive the 

9 production of antisense RNAs which interfere with the 

10 normal processes of pollen cell metabolism. Pollen 

11 cell-specific promoters can further be used to drive 

12 expression of proteins that are toxic to insects or 

13 other pests which consume pollen. These promoters can 

14 also be used to activate the expression of genes 

15 encoding proteins which will enhance the nutritional 

16 value of pollen. 
1? 

18 However, the niimber of pollen cell-specific promoters 

19 which have been well characterised is limited and 

20 different promoters exhibit a range of activities which 

21 cannot be predicted a priori and are difficult to 

22 quantify. The activity of a promoter isolated from one 

23 species of plant may also differ when the promoter is 

24 utilised in an heterologous species - such differences 

25 may be both in the tissue specificity and strength of 

26 the promoter and are more likely to occur with greater 

27 taxonomic distance between plant species. In addition 

28 different promoters may be required to control 

29 expression of multiple genes since a gene silencing 

30 effect can occur if duplicate copies of the same 

31 promoter are used. The choice of piromoter is therefore 

32 limited and has to be experimentally verified in the 

33 system under study. 
34 

35 According to the present invention there is provided an 

36 invertase gene expression control polynucleotide, a 
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1 derivative, a functional equivalent ^ or a part thereof, 

2 which is pollen cell-specific. 
3 

4 By "pollen cell-specific" we mean that the expression 

5 control polynucleotide exhibits a distinct level of 

6 activity (or lack of activity) in pollen cells (ie in 

7 material ranging from developing pollen grain through 

8 to material derived from pollen) compared to the other 

9 tissue types of the transformed plant. 
10 

11 By "expression control polynucleotide" we mean any 

12 polynucleotide which is capable of affecting the 

13 expression of a gene. The term is intended to include 

14 promoters, enhancers and suppressors. 
15 

16 By "functional equivalent" we mean any variation of the 

17 expression control polynucleotide which exhibits 

18 substantially the same functional properties of the 

19 original polynucleotide- 
20 

21 By "derivative" we mean a modified version of the 

22 expression control polynucleotide which exhibits 

23 substantial sequence homology to the original 

24 polynucleotide, for example which include nucleotide 

25 substitutions which have no effect on biological 

26 function. 
27 

28 By "part" we mean a deleted version of the expression 

29 control polynucleotide, which comprises at least a 

30 substantial portion of the original polynucleotide (for 

31 example at least 50% of said polynucleotide). 
32 

33 The preferred type of expression control polynucleotide 

34 is a promoter. 
35 

36 The invertase gene promoter is preferably derived from 
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4 

a dicotyledon, such as potato. 

The expression control polynucleotide of the invention 
may comprise double- or single- stranded DNA or RNA. 

The invention also provides the use of the expression 
control polynucleotide described above to control 
expression of heterologous sequences. Optionally the 
expression control polynucleotide is used to drive 
pollen cell-specific expression of protein-encoding 
heterologous genes in plants eg monocotyledons or 
dicotyledons. Use of the expression control 
polynucleotide in this way in dicotyledons is 
preferred. 

The invention also provides a recombinant expression 
control polynucleotide comprising at least a part of a 
pollen cell-specific expression control polynucleotide 
as described above. The recombinant expression control 
polynucleotide of the invention is capable of specific 
expression of a heterologous sequence in pollen cells. 
The heterologous sequence expressed may encode a 
protein. Alternatively RNA sequences which do not code 
for protein (eg ribosomal RNA or anti-sense RNA) may 
instead be transcribed from the heterologous sequence. 

The invention also provides a polynucleotide having the 
sequence set out in SEQ ID No 1, including derivatives, 
functional equivalents or parts thereof. The preferred 
polynucleotide is that shown in SEQ ID No 1 from 
nucleotides 3144-5396 and more preferably from 
nucleotides 3430-5349. The most preferred 
polynucleotide is the promoter in the 3430-5349 bp 
fragment . 

A deposit of genetic material containing the 



wo 98/41643 PCT/GB98/00833 

5 

1 polynucleotide of SEQ ID No 1 was made at the National 

2 Collection of Type Cultures on 7 February 1997 under No 

3 NCTC 13013- 
4 

5 The present invention also provides a recombinant 

6 nucleotide construct comprising an expression control 

7 polynucleotide according to the invention operably 

8 linked to a heterologous (preferably protein-encoding) 

9 polynucleotide. 
10 

11 Thus, activation of the expression control 

12 polynucleotide may drive the expression of the 

13 heterologous polynucleotide, enabling production of the 

14 encoded protein- Since the expression control 

15 polynucleotide is tissue-specific, production of the 

16 protein will be limited to those tissues where the 

17 expression control polynucleotide is active. 
18 

19 The present invention also provides a recombinant 

20 vector containing an expression control polynucleotide 

21 or a recombinant nucleotide construct as defined above. 
22 

23 According to the present invention there is also 

24 provided a method of producing a recombinant vector, 

25 said method comprising ligating an expression control 

26 polynucleotide as described above into a suitable 

27 vector. A method of producing a transformed cell by 

28 trans feet ing a host cell using said recombinant vector 

29 forms another aspect of the invention. Suitable 

30 vectors and genetic modifications thereof are well- 

31 known in the art. 
32 

33. The present invention also provides a transformed host 

34 cell containing a recombinant nucleotide construct or 

35 vector as defined above. 
36 
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The present invention also provides a transgenic 
organism ( for exeunple a transgenic plant ) containing a 
recombinant nucleotide construct or a vector as defined 
above. The progeny (and seeds) of such transgenic 
organisms forms a further part of the invention. 

The present invention also provides a method for 
controlling the expression of a protein, said method 
comprising operably linking a polynucleotide sequence 
encoding said protein to an expression control 
polynucleotide of the invention. The method is 
especially useful for the expression of proteins in 
pollen. Preferably the protein expressed leads to 
sterility of the transformed plant. 

Thus the invention also provides a method of 
controlling the expression of a heterologous 
polynucleotide in pollen, said method comprising 
operably linking said heterologous polynucleotide to an 
expression control polynucleotide of the invention. 

In one ^nbodiment the promoter for the invertase gene 
of potato is expressed specifically in pollen to 
activate expression of any DNA sequences in the pollen 
of transgenic plants . Below we describe the isolation 
and characterisation of this promoter and how it has 
been used to express genes in pollen. 

The present invention will now be further described 
with reference to the Example and accompanying Figures 
in which: 

Figure I<egends 

Figure 1. Map of sequences detailed in text with 

restriction enzymes used in their cloning. 
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1 Figure 2. whole anther from transgenic potato plant 

2 stained for GUS activity (GUS activity 

3 indicated by the dark areas ) . 
4 

5 Figure 3, cross-section of anther as in Figure. 2 

6 showing staining in individual pollen grains 

7 (pollen grains appear as dark spots). 
8 

9 Figure 4. RT-PCR analysis showing a product of 374 bp 

10 indicating expression from the promoter in 

11 (a) floral and bud tissue^ and (b) in excised 

12 anthers but not in the remainder of the 

13 floral tissue. 
14 

15 Example 

16 A potato {Solanum tuberosum I». ) cv. Saturna genomic 

17 library, consisting of a partial Sau3AI digest of 

18 genomic DNA cloned into AEMBL3, was plated to yield 1 x 

19 10^ pfu which were screened with a radiolabelled carrot 

20 invertase cDNA fragment generated by reverse 

21 transcription-polymerase chain reaction (RT-PCR) using 

22 primers derived from a sequence of carrot cDNA (Sturm 

23 and Chrispeels, 1990). 
24 

25 The primers were: 

26 Forward Primer: 5 ' -AACGATCCAAATGGACCA-3 ' (SEQ ID No 2) 

27 Reverse Primer: 5 ' -GAAAAAATCAGGACATTCCCA-3 ' (SEQ ID 

28 No 3) . 
29 

30 Hybridisation conditions of 5 x SSC at 65**C were 

31 utilised with subsequent low stringency washing of 

32 filters in 2 x SSC at 65°C. After three rounds of 

33 screening two positive clones were obtained plaque 

34 pure. DNA was purified from one positive clone , AGF5, 

35 which was shown to contain an insert of approximately 

36 23 kb of potato DNA. This cloned potato DNA was 
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digested with Xbal and Sail, and fragments cloned into 
pUC19. One subclone, named pGF521, contained 5.4 kb of 
the potato DNA. A complete DNA sequence of this 
fragment is presented (SEQ ID No 1). It was 
determined, by homology to known invertase gene 
sequences, that the 5.4 kb of potato DNA (Figure 1) 
carried sequence of two invertase genes with the 
intergenic region constituting the promoter of the 
downstream gene. A 2.25 kb Hindlll-Xbal fragment (bp 
3144-596; Figure 1) comprising the promoter, 3' end of 
the upstream gene and 5' end of the downstream gene was 
subcloned into pUC19 to yield plasmid pGF5211 
(deposited as NCTC 13013) . This fragment was also 
cloned into pBllOl.3 to give plasmid pRMll.2 which was 
used as a vector for stable plant transformation. In 
pRMll.2 the fragment is fused to the uldA gene from 
Escherichia coli and when the promoter is active in 
plants would drive the transcription of this gene to 
produce the bacteria enzyme /^-glucuronidase (GUS). An 
internal AccI fragment of 1.9 kb (bp 3430-5349; Figure 
1) derived from the 2.2 kb fragment was also cloned 
into pBIlOl.3 to generate plasmid pRM12.3 which was 
also used as a vector for stable plant transformation. 
This fragment was also fused to the uidA gene to drive 
/^-glucuronidase synthesis when active. 

A series of transgenic lines of potato (cv. Desiree) 
plants were generated by Agrobacterium turns faci&ns-- 
mediated transformation using pRH12.3 and pRMll.2 as a 
vector. Plants derived from the use of pRM12.3 as a 
vector were passed through one cycle of tuberisation 
then grown in a controlled environment until flowering 
occurred. The floral tissues including anthers, 
sepals, petals and ovules were separately analysed by a 
GUS histochemical assay performed at two pH values: pH 
5 to assay for endogenous enzyme activity (the control) 
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1 and pH 7 to detect the activity derived from the uidA 

2 ene activated by the invertase promoter. A strong blue 

3 staining, detected only at pH 7 and thus indicative of 

4 bacterial GUS derived from expression of uidA driven by 

5 the invertase promoter, was observed only in pollen 

6 cells {see dark areas of Figures 2 and 3) and in no 

7 other tissues of the flower or elsewhere throughout the 

8 plant, while in control untrans formed plants only a 

9 light background of blue staining was observed in 

10 pollen cells. Prior to the GUS histochemical analysis 

11 an analysis using RT-PCR to detect expression from the 

12 native promoter driving its invertase gene had detected 

13 expression only in floral and bud tissue with no 

14 expression observed in source and sink leaf (Figure 

15 4a), stem, root or tuber. A subsequent RT-PCR analysis 

16 detected expression only in the pollen-containing 

17 anthers and not elsewhere throughout the flower (Figure 

18 4b) . We conclude that the activity of this promoter is 

19 restricted to pollen. 
20 

21 The recombinant DNA procedure utilised were as 

22 described by Sambrook et al (1989). Plant tissue 

23 culture and transformation protocols were as detailed 

24 by Hedley (1995). Histochemical assay of GUS was 

25 performed as indicated by Jefferson (1987). 
26 

27 The invention describes a promoter sequence 

28 demonstrated to be active specifically in pollen of 

29 Solarium tuberosum. This promoter is likely to be 

30 active in pollen of other species of the Solanceae, and 

31 may be active in pollen of other plant species 

32 including those in which the production of male sterile 

33 plants for hybrid production is important eg 

34 Lycopersicon esculentum and the Brassicaceae . It is a 

35 unique sequence described with this activity in Solanum 

36 tuberosum, and for other plants it provides an 
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alternative to the use of pervious ly isolated 
promoters • It has the advantage of its own 
characteristic activity profile and when used in 
heterologous species may escape problems such as gene 
silencing, which can compromise the use of homologous 
promoters. It has potential use in the genetic 
engineering of male sterile plants and lines for the 
restoration of fertility, for the production of 
proteins in pollen which are toxic to insect and other 
pests, or for the production of protein of enhanced 
nutritional value in pollen. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICA1^IT: 

(A) NAME: SCOTTISH CROP RESEARCH INSTITUTE 

(B) STREET: INVERGOWRIE 

(C) CITY: DUNDEE 

(E) COUNTRY: UK 

(F) POSTAL CODE (ZIP): DD2 5DA 

(G) TELEPHONE: 01382 562731 

(H) TELEFAX: 01382 562426 

(ii) TITLE OF INVENTION: EXPRESSION CONTROL POLYNUCLEOTIDES 
(iii) NUMBER OF SEQUENCES: 3 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version ttl.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10811 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: . double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Solanum tuber sum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

AAGCCTAGTT CGACCTGCAG TCAACGGATC TTTATAGCTA CATATATATA AGATTGATCA 60 

TTCTTGATAA GCTGGACGTC AGTTGCCTAG AAATATCGAT GTATATATAT TCTAACTAGT 120 

AAGAACTATT TTCAAAATTA TGTATACATA CACACACATA CATAATTATG TGGTTCATTT 180 

GTGTTAGTTA AAGTTTTAAT ACATATGTAT GTGTGTGTAT GTATTAATAC ACCAAGTAAA 24 0 

CACAATCAAT TCTATTATTC AGTAGTCAGT ATTCATTTTT GAAATGTAAT TAATTT/U^T 300 

TTGTGTCTAA AGATAATAAG TCATCAGTCA TAAGTAAAAA CTTTACATTA ATTAAATTTA 360 

AACACAGATA ATATTCTATT TTGGAGAACA AAATCGCTCA TGATCAACAA TCGATGACTC 420 

AATTTTTAAT TATAAGATAA AACCTCTTGT TTTAGCGAGT ACTAGTTGTT AGCTACTGAG 4 80 

TTAAAAATTA ATTTAAATTC GAAATTAGAT TAATTATTAT GGCAAGACAA TTACAAGGCT 540 

AAGGTTTTGT TAAATTTAAG CTTTAATCTA ATTAAT7ATA CCGTTCTGTT AATGTTCCGA 600 

TTCCAAAACG ATAAGAATGT GCIAAAAGAGA AAAAGAAACA TGAAATATAT GAAAAAGTTC 660 

TTTTAACCTC TATTCTTACA CGTTTTCTCT TTTTCTTTGT ACTTTATATA CTTTTTCAAG 720 

AAAATTGGAA AGATTTTGGC CATGGAATTA AGGTGAAAAT TAATTTGTTG GAGGCACCCT 7 80 
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TTATATTCCT 


TCTAAAACCG 


GTACCTTAAT 


TCCACTTTTA ATTAAACAAC 


CTCCGTGGGA 


840 


AATATAAGGC 


CTTGGCATTT 


TCTTCTCCCT 


TAT AT XTTTT 


ftrwyrn f^rrt 71 71 7\ rp 

CCT 1 CI AAAl 


TATTATTATT 


900 


ATTTTTATTG 


GAACCGTAAA 


AGAAGAGGGA 


ATATAAAAAA 


ggaagatti a 


ATAATAATAA 


960 


TAAAAATTVAA 


TTATTATTAT 


TATTATTT^G 


TGTTGAAATA 


TAGT GACATl 


TCATACATAC 


1020 


TCACATATTT 


AATAATAATA 


ATAATAATTC 


ACAACTTTAT 


AT C ACT GT AA 


AGTATGTATG 


1080 


AGTGTAT/^G 


TGTACATTTA 


ATATGTAGGT 


CTTATATTAA 


TTAAAACTT G 


CCAAACATAT 


1140 


TGTCTCTTAC 


ACATGTAAAT 


TATACATCCA 


GAATATAATT 


7VATTTT GAAC 


GGTTTGTATA 


1200 


ACAGAGAATT 


AAAGTTGGCT 


CCCCCAACCC 


CCAACCTCAC 


cccaccctaa 


AAAAAATTAC 


1260 


TTCATCAATA 


TTTCAACCGA 


GGGGGTTGGG 


GGTTGGAGTG 


GGGTGGGAi 1 


TTTTTTAATG 


1320 


AAGTAGTTAT 


TCGTTTTTTC 


ATGTGACACA 


ATTTTCTTAT 


TTAATTl Gi 1 1 


ATCTA7VAGAA 


1380 


TGACTATTTA 


AGCAAAAAAG 


TACACTGTGT 


TAAAAGAATA 


»V TV *T>«T^ TV TV TV 7V 

AATTAAACAA 


TAGATTTCTT 


1440 


ACTGATAAAT 


ATATATTTAA 


ACCTGTACTT 


ATCATTCTAC 


TV m/^ TV m TV tyi tv A 

AT GAT AT ATA 


TATATATATA 


1500 


TATATATATA 


TATATAAATT 


TGGACATGAA 


TAGTAAGATG 


TACTATATAT 


ATATATATAT 


1560 


ATATATATAA 


TGTATATATA 


TGTGTGTGTG 


TGTTTTTATA 


GTCACAATAA 


ATCTCATGAT 


1620 


ATGTTTCAGT 


ACATATATAT 


ACACACACAC 


ACAAAAATAT 


CAGT GTT ATT 


TAGAGTACTA 


1680 


TACAAAGTCA 


TCATAATTTT 


CAAATATTTT 


TTACTTTATT 


TTTTAAATTA 


CGTATTAAAT 


1740 


CAAATTACGT 


AGTATTAAAA 


GTTTATAAAA 


AATGAAATAA 


AAAATTTAAT 


GCATAATTTA 


1800 


GTTTAATGCC 


CATATAAATT 


AAAATGATTC 


AACTAATTAG 


TCATTTTTGT 


ATTTCCTACA 


1860 


TTTCTGTGTG 


GTATATTTAA 


TTTTACTAAG 


TTGATTAATC 


AGTAAAAACA 


TAAAGGATGT 


1920 


AAAGACACAT 


CACCTTTGAT 


TTGTTAATTA 


TTATAGTATT 


TGATTATTTC 


TTAATCATTG 


1980 


ATTAATTATA 


GTGGAAACTA 


AACAATTAAT 


AATATCATAA 


ACTAATAAAG 


AATTAGT/KAC 


2040 


TAATTAATAG 


TAATGCTTTC 


TATTGTCTTC 


ACTAATCATT 


T CTTTTTTTA 


AAATTCTTTT 


2100 


GTTCTTATAC 


ATTACGAAAG 


ATAACAGAAG 


TGATTAGTAA 


AGAAAAAAAT 


TTTAAGA/^AA 


2160 


CAAGAATATT 


GGTCTCTCTT 


CATTGGATAC 


CTTTCAAATA 


TACA7\AGACC 


CTAATGGTGA 


2220 


GTTAGATTAA 


CCAGAGAGAA 


GTAACCTATG 


GAT^GTTTAT 


ATGTTTCTGG 


GATTACCACT 


2280 


CAATCTAATA 


TTTTTTTTAT 


TTATTCTTCC 


TAAATAAACT 


ATATATACAC 


ACAACTTGTC 


2340 


AATAACTCAT 


AAAAAAAATA 


AATAAGAAGG 


ATTTATTTGA 


TATATATGTG 


TGTTGAACAG 


2400 


TTATTGAGTA 


TTATATTATT 


GACTATCATT 


TTATTTTTCA 


ACAGCACCAA 


TGTATTATAA 


2460 


TGGAGTATAT 


AATATAATAA 


CTGATAGTAA 


AATAAAAAGT 


TGTCGTGGTT 


ACATAATATT 


2520 


ACCTCATATT 


CATTTATTCT 


ATCAATACAA 


TCCAAAAGGA 


TCAGTATGGG 


GCAATATTGT 


2580 


TTGGGCTCAA 


GTAAATAAGA 


TAGTTATGTT 


AGGTTTTCCT 


AGTCATACCC 


CGTTATAACA 


2640 


AACCCGAGTT 


TCAGTCTCAA 


/JlAGACTTGAT 


AAATTGGATC 


CATTTAGAAC 


CCGCAATTTA 


2700 


TCCATCTAAA 


AGTCAGAGTT 


TTCTGAACTA 


TTTAACCTAG 


GTAAATCTTG 


GGCGTTAAAT 


2760 
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AGGTAGATTA AAATTTGACA AATATGGTGC TTGGTCCGGG TCAGCT^ACTA TTCTACCAAA 
TAACAAGCCT TTTAAACTGT TTATACCACG AACCAGGCCC AGTCGTTGAT AAGATGGTTT 
ATTGTTCGGC GTTATCTTAT ACACCGGAGT AGTAGATTCC CATGATTCTC AAGTTCAAAA 
TTATGCAATG CAATAGAATA TGTGGCCTCA TCATCTAAGG GTACTT^GAG TTCAAGTTTT 
AATACGTTAC CCGGCTAACT TGTCTGATCC ATTTCTTCGT AAATGGATCA AACCTAATAA 
C7\ACCCGTTG GGCCGATTGA ACAGACTAGG TAAAGAAGCA TTTACCTAGT TTGGATTATT 
GTTGGGCAAG ATTGTACCTG ACAATAGCAT CAACTWKACC AAATTTCGTG ATCCAACAAC 
CGCATGGATC TAACATGGAC TGTTATCGTA GTTGTTTTGG TTTAAAGCAC TAGGTTGTTG 
GCGTACCTAG GGCCAAGATG GGCTTTGGAG AATTGT7ATA GGAAGTATGA GAAAACATAG 
AGGGATGGCC CCGGTTCTAC CCGAAACCTC TTAACATTAT CCTTCATACT CTTTTGTATC 
TCCCTACCGA TTATTGTATA GAAGTAGAGA CTTCATT7\7VA TGGGCCAAAG CCCAACATCC 
ACTTCATTCT AATAACATAT CTTCATCTCT G/siAGTAATTT ACCCGGTTTC GGGTTGTAGG 
TGAAGTAAGA TCTCCTCATA CTGGAAATTG GGAATGTCCT GATTTTTTTC CTGTATCATT 
AAAAAATACT AGAGGAGTAT GACCTTTAAC CCTTACAGGA CTAAAAAAAG GACATAGTAA 
TTTTTTATGA AATGGCTTAG ATGCATCGTA TCGCGGAAAA AATGTCAAAT ATGTCCTTAA 
GAATAGCCTT TTACCGAATC TACGTAGCAT AGCGCCTTTT TTACAGTTTA TACAGGAATT 
CTTATCGGAT GATGTTAATA GGTTTGAGTA TTACACTATT GGTATGTATG ACACCAAAAA 
AGATAGGTAA CTACAATTAT CCAAACTCAT T^TGTGATAA CCATACATAC TGTGGTTTTT 
TCTATCCATC ATTCCTGATA ACAATTCTAT CGATGGTTCG AAGGGATTGA GGCTTGACTA 
TGGGAATTTG T/^GGACTAT TGTTAAGATA GCTACCAAGC TTCCCTAACT CCGAACTGAT 
ACCCTTAAAC TATGCATCTA AATCATTCTA TGACCCTATG AAGAATCGAA GAATTGTATG 
GGGTTGGACG ATACGTAGAT TTAGTAAGAT ACTGGGATAC TTCTTAGCTT CTTAACATAC 
CCCAACCTGA AATGAATCAG ATGTTTTACC TGACGATGAA ATTAAGAAAG GATGGGCTGG 
AATTCAAGCT TTACTTAGTC TACAAAATGG ACTGCTACTT TAATTCTTTC CTACCCGACC 
TTAAGTTCGT ATTCCGCGTA AAGTATGGCT CGACCCTAGT GGTAAACAAC TGATTCAATG 
GCCTATTGAA TT^GGCGCAT TTCATACCGA GCTGGGATCA CCATTTGTTG ACT7UVGTTAC 
CGGATAACTA GAATTAGAAA CATTAAGAAA GCAAAAGATT ATTCAATTGA ACAACAAGAA 
GTTGAGCAAT CTTAATCTTT GTAATTCTTT CGTTTTCTAA TAAGTTAACT TGTTGTTCTT 
CAACTCGTTG GGAGAAATGT TTGAAGTTAA AGGAATCTCA GCATCACAGG TTTTAACTTT 
TCCTTATTAC CCTCTTTACA AACTTCAATT TCCTTAGAGT CGTAGTGTCC AAAATTGAAA 
AGGAATAATA ACTATAGTCT TTTAAATATC ACAACAAAAA GTAATTTTAA TTAGTATACA 
ATGTATAATT TGATATCAGA AAATTTATAG TGTTGTTTTT CATTAAAATT AATCATATGT 
TACATATTAC AATGTATAAC TATTATATCA ATTGCATATG ATAGGTCTAT ATATAGTAGA 
TTAATTATTG TTACATATTG ATAATATAGT TAACGTATAC TATCCAGATA TATATCATCT 
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AATTAATAAT 


ATTCACTTGT 


TATATGTATA 


TCGGTGATAA 


TTATAAATTG 


TTGCAATTGA 


4860 


ACGAGTGAAA 


TAAGTGAACA ATATACATAT AGCCACTATT 


AATATTTAAC 


AACGTTAACT 


4920 


TGCTCACTTA 


TATTTTTGGT 


TAGTCGGGAA AAA7\ATTTGA 


CTAGGGCTAG 


GGCTAAATAA 


4980 


ACTTGTGCGT 


ATAAAAACCA ATCAGCCCTT 


TTTTT7KAACT 


GATCCCGATC 


CCGATTTATT 


5040 


TGT^CACGCT 


l/\oL. 1 1 L. 1 1 V- 


TCATCTTACA 


TTTCTTTGGT 


TGCAGGCTGA 


TATTGAAGTG 


5100 


TCGTTCTCTA 




AGTAGAATGT AAAGAAACCA ACGTCCGACT 


ATAACTTCAC 


5160 


AGCAAGAGAT 


111 w\r\vj 1 1 i. 


GAACAAGGCC 


GAACAATTTG 


ATCCTAAATG 


GGCCGACCTT 


5220 


TATGCCCA/^ 




TTTGTTCCGG 


CTTGTTAAAC 


TAGGATTTAC 


CCGGCTGGAA 


5280 


ATACGGGTTG 


A T* ^ T T* T" T r* 


\_/A J. y-\/TA\v5w vj A 


TCGACTATCC 


AAGGTGGGCT 


TGGACCATTT 


5340 


GGGCTTGTGC 




rZT ATTTPCPA 
o 1 /\ 1 a 1 w w \^J^ 


AGCTGATAGG 


TTCCACCCGA 


ACCTGGTAAA 


5400 


CCCGAACACA 


?~'A'T"T'7yi^(^'T"Pf 
1 A^J^ 1 1 


TA A A AAPTTf^ 


GAAGAATACA 


CACCTGTTTT 


CTTTCGAGTG 


5460 


TTTAAGGCTT 


Kj 1 AA i L-oAA^j 


a TTT'P'P A A r" 
AX i 1 i 1 o/a/-\v-. 


CTTCTTATGT 


GTGGACAAAA 


GAAAGCTCAC 


5520 


AAATTCCGAC 


AAAAwAAl iA 


T a A rz fZTT PT P 


ATGTGCTCAG 


ATGCTAGAAG 


GTTTGTTTCT 


5580 


TCAATTG7\AG 


TTTTCl lAAl 


ATTPP A AriAG 
Al J. 


TACACGAGTC 


TACGATCTTC 


CAAACAAAGA 


5640 


AGTTTSJVCTTT 


TAGT 1 1 AA 1 


P ATP ATAflTT 
oA 1 A X r\o X X 


TACATAGTTA ACTCAATATA 


GGAGCACAAA 


5700 


ATTTGAGTAA 


ATCAACAl lA 


L- X A<J 1 Ax %-AA 


ATGTATCAAT 


TGAGTTATAT 


CCTCGTGTTT 


5760 


TAAACTCATA 


ATCAAGAATT 


Al AAl VjAK-L.^ 


GACTTTGATA 


TCATGATAAG 


AAAT AC AT CT 


5820 


ACTTATCGAT 


TAGTTCTTAA 


lAl iALrlooo 


CTGAAACTAT 


AGTACTATTC 


TTTATGTAGA 


5880 


TGAATAGCTT 


T GTTTATT AG 


m rT% TV rpm TV A A 

T GT C AT X AAA 


AAACTCTAAC 


CTTGTTTAGT 


TTPTTTATTA 

X X ^>» X X X /AX X 


5940 


ATGAGCAGAA 


ACAAATAATC 


AC AGTAAT TT 


TTTGAGATTG 


GAACAAATCA 


AAGAAATAAT 


6000 


TACTCGTCTT 


CTACCATGAG 


AC7\AAATGAA 


GCAATGTACA AGCCCTCATT 




6060 


GTAGATGTAA 


GAT G GT ACT C 


TGTTTTACTT 


CGTTACATGT 


TCGGGAGTAA 


APGACCTATA 


6120 


CATCTACATG 


ATTTAGTAGA 


CACGAAGAAG 


TTATCTCTTA 


GGAGTTTGGT 


AAGTTTGCTT 

rvws XXX X X 


6180 


TCATCATTTC 


TAAATCATCT 


GTGCTTCTTC 


AATAGAGAAT 


CCTCAAACCA 


TT CAAAC GAA 


6240 


AGTAGTAAAT 


TTTATTTTTT 


TATAATTTAT 


TTGATCAAGC 


TTTTAAGATT 


G C ATT ATTTT 


6300 


GAAGAGTAAA 


APJ\TPiPJ\PJ\A 


ATATTAAATA 


AACTAGTTCG 


AAAATTCTAA 


CGTAATAAAA 


6360 


CTTCTCATTC 


GATTTGTGTT 


TGACTAATCA 


GTTTGTATCA 


TATGTATATT 


TTTCAGATTG 


6420 


ATAACTCAGG 


CTAAACACAA 


ACTGATTAGT 


CT^AACATAGT 


ATACATATT^ 


AAAGTCTAAC 


6480 


TATTGAGTCT 


AGTGGAGAGT 


TTTGGTGCTG 


GTGGAAAAAC 


ATGCATAACA 


TCGAGGGTGT 


6540 


ATCCAACGTA 


TCACCTCTCA 


AAACCACGAC 


CACCTTTTTG 


TACGTATTGT 


AGCTCCCACA 


6600 


TAGGTTGCAT 


AGCGATTCAT 


AAT7VATGCAC 


ATTTATTTGT 


CTTCAATAAT 


GGATCTGAGA 


6660 


CAATCACAAA 


. TCGCTAAGTA 


TTATTACGTG 


TAAATAAACA 


GAAGTTATTA 


CCTAGACTCT 


6720 


GTTAGTGTTT 


TGAGACTCTT 


AATGCTTGGA 


, GCATGGATGT 


ACCTAAGATG 


CACTAAATAT 


6780 
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TTTTTTTTAA 


ACTCTGAGAA 


TTACGAACCT 


CGTACCTACA 


TGGATTCTAC 


GTGATTTATA 


6840 


AAAAAAAATA 


AAAGGAATTA 


TGTCTACAAC 


CATATATGTC 


TAAAGAGACA 


AAAATTGTGT 


6900 


CAAATTTAAT 


TTTCCTTAAT 


ACAGATGTTG 


GTATATACAG 


ATTTCTCTGT 


TTTTAACACA 


6960 


GTTTAAATTC 


AGTAGATGAT 


GTCCTCAAGA 


ATCCTCTATA 


ATTGTCTCTT 


AATTTATTTT 


7020 


GGTGAATTTG 


TCATCTACTA 


CAGGAGTTCT 


TAGGAGATAT 


TAACAGAGAA 


TTAAATAAAA 


7080 


CCACTTAAAA 


GAAGGCAAAG 


AGTGTGTATA 


TGGCTTGTCT 


AGTACCATAT 


ATATACTAAG 


7140 


AAAGAAATTT 


CTTCCGTTTC 


TCACACATAT 


ACCGAACAGA 


TCATGGTATA 


TATATGATTC 


7200 


TTTCTTTAAT 


GTTAGCTTTC 


CTTTTTGTTT 


TGTTACACAA 


TCAAATGTGT 


GGTCTTATGT 


7260 


AGAACTAATA 


CAATCGAAAG 


GAAAAACAAA 


ACAATGTGTT 


AGTTTACACA 


CCAGAATACA 


7320 


TCTTGATTAA 


TTTGGTAATA 


TTAGGCAAGT 


TGTTATGTGA 


CTTATTTTAT 


TCAAATATAA 


7380 


TAAGAAGTTT 


AAACCATTAT 


7VATCCGTTCA 


AC/KATACACT 


GAATT^AAATA 


AGTTTATATT 


7440 


ATTCTTCAAC 


7U\AGAGAAGA 


GTACAAGTAA 


GTAGAGGCGA 


ATTCAGGATT 


TGAAGAGTTT 


7500 


GGGTGCATCG 


TTTCTCTTCT 


CATGTTCATT 


CATCTCCGCT 


TAAGTCCTAA 


ACTTCTCAAA 


7560 


CCCACGTAGA 


ATATTATTTT 


AATTTAAGCA 


TTAGTAAAAA 


TATTAATGAC 


GGCTAACGAA 


7620 


TT^TGTCATT 


TAT/iATTVAAA 


TTAAATTCGT 


AATCATTTTT 


ATAATTACTG 


CCGATTGCTT 


7680 


ATTACAGTAA 


TTTAATAGCA 


AATAAACTCT 


TTAACTTGTA 


ATAAAATGAG 


TAAAAAATGG 


7740 


GTTGTTGGCT 


AAATTATCGT 


TTATTTGAGA 


AATTGAACAT 


TATTTTACTC 


ATTTTTTACC 


7800 


CAACAACCGC 


GAAATCGACT 


CTAGGTCGTA 


GGGGTGGACT 


CCCTTACTTC 


CAACAACGTA 


7860 


CTAGATCAAG 


CTTTAGCTGA 


■ GATCCAGCAT 


CCCCACCTGA 


GGGAATGAAG 


GTTGTTGCAT 


7920 


GATCTAGTTT 


GCCTTCTAAA 


GGTGCACTCT 


TTTTTATATT 


CTCTATTTTT 


ACTAGTTTCT 


7980 


CAAGATACTA 


CGGT^AGATTT 


CCACGTGAGA 


AAAAATATAA 


GAGATAAAAA 


TGATCAAAGA 


8040 


GTTCTATGAT 


GTAGATCCGC 


CTCTGCAAGT 


AAGCGGAAGG 


ACAAATTTTT 


ACCTTACTAA 


8100 


CTCTATTGAA 


CATCTAGGCG 


GAGACGTTCA 


TTCGCCTTCC 


TGTTTAAAAA 


TGGAATGATT 


8160 


GAGATAACTG 


GCACCAAATC 


CCTATTAACA 


AGCATGAAAA 


TATGAGAAGA 


CGAAAGAACT 


8220 


7VAGTAAGTTC 


CGTGGTTTAG 


GGATAATTGT 


TCGTACTTTT 


ATACTCTTCT 


GCTTTCTTGA 


8280 


TTCATTCAAA 


TAATTATTGT 


ATAATTTATA 


ACACACTCTA 


GGATAATATT 


ACAAATAAGA 


8340 


ATAGTGAATT 


ATTAATAACA 


TATTAAATAT 


TGTGTGAGAT 


CCTATTATAA 


TGTTTATTCT 


8400 


TATCACTTAA 


CTTAATTAAT 


GACGAACTAT 


AAAAGCAAAG 


AAGGAAGAGG 


AGTGATGAGC 


8460 


CTCGGAAAAT 


GAATTAATTA 


CTGCTTGATA 


TTTTCGTTTC 


TTCCTTCTCC 


TCACTACTCG 


8520 


GAGCCTTTTA 


AAAAATCACA 


AATGAATAAA 


GATGGCTAAA 


ATAGAGTAAC 


TTGATGAAGT 


8580 


TGATACTTCT 


TTTTTAGTGT 


TTACTTATTT 


CTACCGATTT 


TATCTCATTG 


AACTACTTCA 


8640 


ACTATGAAGC 


ATTCATAATC 


AAATCTGTTC 


AAAAACACTT 


GATAGTTAAT 


TTTTAATTTT 


8700 


AAGAGATGCG 


TAAGTATTAG 


TTTAGACAAG 


TTTTTGTGAA 


CTATCAATTA 


AAAATTAAAA 


8760 


TTCTCTACGA 


CAATATCGTT 


TCTTACTATT 


CTTTTAGGGC 


TTTTCGCCCT 


AGGGATATAG 


8820 
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TATATATGAT GTTATAGCAA AGAATGAT7\A GAAAATCCCG AAAAGCGGGA TCCCTATATC 
ATATATACTC CAAATTTCAC GCCATATATA AATAATGTGG AAAAAACTTA TAATTTTCAA 
GGTAACAAGG GTTTAAAGTG CGGTATATAT TTATTACACC TTTTTTGAAT ATTAAAAGTT 
CCATTGTTCA AATCATATTT TTGTTACGCC TTCGTGGAGA CTACTTCCTC AATCGTAACA 
AAGAACAACT TTAGTATAAA AACAATGCGG AAGCACCTCT GATGAAGGAG TTAGCATTGT 
TTCTTGTTGA TTTCTVAGTGG CGACTCAAAA AAGTCGCCAC GACCGGTGAT AACCTTTTGT 
GGCGATTATT AAAGTTCACC GCTGAGTTTT TTCAGCGGTG CTGGCCACTA TTGGAAAACA 
CCGCTAATAA TTTAGTCGCC ACTAAACATT ATTTGTGATC ATATTTTCTT TTCTTTTTCA 
T^'YrTTTA'nr AAATCAGCGG TGATTTGTAA T7VAACACTAG TATAAAAGAA AAGAAAAAGT 
AAAAAATAAT AAGGTCAT^AT TTACCCCTTT ATCTTATTTA TGTAAATTGG AAAATCCCAA 
ATTTTGCATA TTCCAGTTTA AATGGGGAAA TAG7UVTAAAT ACATTTAACC TTTTAGGGTT 
TAAAACGTAT ATTTTTCGAA TTCCTTTTTT TTTAACACAC TCAAAAGTCA AAACATTAAA 
GAACGGAATA TAAPJ\AGCTT AAGGAAAAAA AAATTGTGTG AGTTTTCAGT TTTGTAATTT 
CTTGCCTTAA GCAAATTAAA TGGCAAAAGA CTTGTTCTAA CAAAAAATTA TTAGTAAAAC 
AGACTCATGT CGTTTAATTT ACCGTTTTCT GAACAAGATT GTTTTTTAAT AATCATTTTG 
TCTGAGTACA AATGTAACAA TAACCAACAA ATTATCCAAA ATTTTAGATA AATATTATCC 
AAACAAATAT TTACATTGTT ATTGGTTGTT TAATAGGTTT TAAAATCTAT TTATAATAGG 
TTTGTTTATA AAAATAATCA TCCAATCCAT TTAATTTATT TTTAAAAAAA ATCCTAAATT 
AACTCTCCAT TTTTATTAGT AGGTTAGGTA AATTAAATAA AAATTTTTTT TAGGATTTAA 
TTGAGAGGTT CTTTCACTCA AAAACAAACT CTACCCATTT TTTTCACTAT AAATACTCTT 
CATAATTTTA GAAAGTGAGT TTTTGTTTGA GATGGGTAAA AAAAGTGATA TTTATGAGAA 
GTATTAAAAC ATTTATTCTT CATTCCCAAG TTTCTTTCTC CCTATCCAAA AAAAATAAAA 
AAAAATTATG TAAATAAGAA GTAAGGGTTC AT^GAAAGAG GGATAGGTTT TTTTTATTTT 
TTTTTAATAT TAGATTAATT ACCACTATCT GTCAAAGCCC AATTATTGAA AAAA7U\AAAA 
CTATGGATTA ATCTAATTT^ TGGTGATAGA CAGTTTCGGG TTAATAACTT tTTTTTTTTT 
GATACCTAAA TTCATCTAAT TCTCGTTGGG CTTTGCCAGT TATCTTAGTT TGCTTTTTTA 
TAGTTTTATT AAGTAGATTA AGAGCAACCC GTW^lCGGTCA ATAGAATCAA ACGAAAAAAT 
ATCAAAATAT ATCCAATAAT GTTGTTTTTG CTTCTCACAA AGTTTTTATT CATTTGCAAT 
CTCAAAATGA TAGGTTATTA CAACAAAAAC GAAGAGTGTT TCAAAAATAA GTAAACGTTA 
GAGTTTTACC TGTAAATGTT CAAACTGTTC ATAGAACTGG TTACCATTTT CAGCCAGAAA 
AACATTGGAG ACATTTACAA GTTTGACAAG TATCTTGACC AATGGTAAAA GTCGGTCTTT 
TTGTAACCTT CAATGGTATG TATACGCGCT TTTTCGCGTT TTTTCTTCTT TATATATGTG 
TCAAAAAAAA GTTACCATAC ATATGCGCGA AAAAGCGCAA AAT^AGAAGAA ATATATACAC 
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AGTTTTTTTT A 

<2) INFOFIMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Forward primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
AACGATCCAA ATGGACCA 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRAJ>IDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; other nucleic acid 

(A) DESCRIPTION: /desc = "Reverse primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GAAAAAATCA GGACATTCCC A 
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1 
2 
3 
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20 
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24 
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28 
29 
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31 
32 
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34 
35 
36 



1. A pollen cell-specific expression control 
polynucleotide of an invertase gene, or a 
derivative, functional equivalent or part of said 
expression control polynucleotide. 

2. An expression control polynucleotide as claimed in 
Claim 1 which is a promoter. 

3. An expression control polynucleotide as claimed in 
either one of Claims 1 and 2 which comprises a 
sequence substantially as set out in SEQ ID No 1 
or as present in NCTC Deposit No 13013, or a 
functional equivalent or part thereof. 

4. An expression control polynucleotide as claimed in 
Claim 3 which comprises the sequence of 
nucleotides 3430-5349 of SEQ ID No 1. 

5. A recombinant expression control polynucleotide 
comprising at least a part of a polynucleotide as 
.claimed in any one of Claims 1 to 4 . 

6. A recombinant nucleotide construct which comprises 
an expression control polynucleotide as claimed in 
any one of Claims 1 to 5 operably linked to a 
heterologous polynucleotide. 

7. A construct as claimed in Claim 6 which is in the 
form of a vector. 

8. A construct as claimed in either one of Claims 6 
and 7 wherein said heterologous polynucleotide 
encodes a protein. 
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1 9. A host cell transformed with a construct as 

2 claimed in any one of Claims 6 to 8 . 
3 

4 10. A transgenic organism transformed with a construct 

5 as claimed in any one of Claims 6 to 8. 
6 

7 11. An organism as claimed in Claim 10 which is a 

8 plant • 
9 

10 12. An organism as claimed in Claim 11 wherein said 

11 expression control polynucleotide is pollen cell- 

12 specific and the heterologous polynucleotide 

13 operably linked thereto encodes for a protein 

14 which causes male sterility of said plant. 
15 

16 
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Fig. 3 
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Fig. 4b 
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